An evaluation of resampling methods for assessment of survival risk prediction in high-dimensional settings.
نویسندگان
چکیده
Resampling techniques are often used to provide an initial assessment of accuracy for prognostic prediction models developed using high-dimensional genomic data with binary outcomes. Risk prediction is most important, however, in medical applications and frequently the outcome measure is a right-censored time-to-event variable such as survival. Although several methods have been developed for survival risk prediction with high-dimensional genomic data, there has been little evaluation of the use of resampling techniques for the assessment of such models. Using real and simulated datasets, we compared several resampling techniques for their ability to estimate the accuracy of risk prediction models. Our study showed that accuracy estimates for popular resampling methods, such as sample splitting and leave-one-out cross validation (Loo CV), have a higher mean square error than for other methods. Moreover, the large variability of the split-sample and Loo CV may make the point estimates of accuracy obtained using these methods unreliable and hence should be interpreted carefully. A k-fold cross-validation with k = 5 or 10 was seen to provide a good balance between bias and variability for a wide range of data settings and should be more widely adopted in practice.
منابع مشابه
O-16: Comparison of Pre-Antral Follicle Culture Development during 2 Dimensional and 3 Dimensional Culture Systems
Background: Setting up an in vitro follicle culture system that resembles in vivo ovary condition has high value in research. Additionally, expression evaluation of folliculogenesis involved genes could lead us to the designing of better culture system. Materials and Methods: ovaries of 12-day-old female NMRI mice were removed, 100-130 μm pre-antral follicles were mechanically isolated from fre...
متن کاملNursing Students` Viewpoints on Challenges of Student Assessment in Clinical Settings: A Qualitative Study
Introduction: Student assessment in clinical settings is an important subject in nursing education. Reviewing students’ clinical skills, some problems put forward which manifest in students` complaints and frequent meetings between them and their instructors to discuss these problems. Despite some efforts in this area, it is still a major challenge for nursing students. In this regard, nursing ...
متن کاملEstimation of the Cardiovascular Risk Using World Health Organization/International Society of Hypertension (WHO/ISH) Risk Prediction Charts in a Rural Population of South India
Background World Health Organization/International Society of Hypertension (WHO/ISH) charts have been employed to predict the risk of cardiovascular outcome in heterogeneous settings. The aim of this research is to assess the prevalence of Cardiovascular Disease (CVD) risk factors and to estimate the cardiovascular risk among adults aged >40 years, utilizing the risk charts alone, and by the ad...
متن کاملروشهای بازنمونهگیری بوت استرپ و جک نایف در تحلیل بقای بیماران مبتلا به تالاسمی ماژور
Background and Objectives: A small sample size can influence the results of statistical analysis. A reduction in the sample size may happen due to different reasons, such as loss of information, i.e. existing missing value in some variables. This study aimed to apply bootstrap and jackknife resampling methods in survival analysis of thalassemia major patients. Methods: In this historical coh...
متن کاملEmpirical Likelihood Approach and its Application on Survival Analysis
A number of nonparametric methods exist when studying the population and its parameters in the situation when the distribution is unknown. Some of them such as "resampling bootstrap method" are based on resampling from an initial sample. In this article empirical likelihood approach is introduced as a nonparametric method for more efficient use of auxiliary information to construct...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Statistics in medicine
دوره 30 6 شماره
صفحات -
تاریخ انتشار 2011